WHAT IS CLAIMED IS: 



1. For use in a processor having an at least four-wide 

2 instruction issue architecture^ a mechanism for pipeline processing 

3 multiply-accumulate instructions with out-of-order completion, 

4 comprising: 

5 a multiply-accumulate unit (MAC) having an initial multiply 

6 stage and a subsequent accumulate stage; and 

7 out-of-order completion logic, associated with said MAC, that 

8 causes interim results produced by said multiply stage to be stored 
90 when said accumulate stage is unavailable and allows younger 

lOO instructions to complete before said multiply-accumulate 

11^ instructions. 

p 2. The mechanism as recited in Claim 1 wherein said initial 

2r^ multiply stage and said subsequent accumulate stage are single 
3Ci clock cycle stages. 

3. The mechanism as recited in Claim 1 wherein said out-of- 

2 order completion logic is contained in a writeback stage of a 

3 pipeline in said processor. 

4. The mechanism as recited in Claim 1 wherein said out-of- 
2 order completion logic writes back said interim results to at least 
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one register in said MAC before said multiply-accumulate 
instructions arrive at said accumulation stage of said MAC. 

5. The mechanism as recited in Claim 1 wherein said interim 
results are unavailable to an external program executing in said 
processor. 

6. The mechanism as recited in Claim 1 wherein grouping 
logic within said processor groups said multiply-accumulate 
instructions based on said mechanism. 

7 . The mechanism as recited in Claim 1 wherein said 
processor is a digital signal processor. 



8. For use in a processor having an at least four-wide 
instruction issue architecture, a method of pipeline processing 
multiply-accumulate instructions with out-of-order completion, 
comprising: 

providing a multiply-accumulate unit (MAC) having an initial 
multiply stage and a subsequent accumulate stage; 

causing interim results produced by said multiply stage to be 
stored when said accumulate stage is unavailable; and 

allowing younger instructions to complete before said 
multiply-accumulate instructions . 

9. The method as recited in Claim 8 wherein said initial 
multiply stage and said subsequent accumulate stage are single 
clock cycle stages. 

10. The method as recited in Claim 8 wherein said causing is 
carried out in a writeback stage of a pipeline in said processor. 

11. The method as recited in Claim 8 wherein said causing 
comprises writing back said interim results to at least one 
register in said MAC before said multiply-accumulate instructions 
arrive at said accumulation stage of said MAC. 



12. The method as recited in Claim 8 wherein said interim 



results are unavailable to an external program executing in said 
processor. 

13. The method as recited in Claim 8 further comprising 
grouping said multiply-accumulate instructions based on said 
mechanism. 

14. The method as recited in Claim 8 wherein said processor 
is a digital signal processor. 



15. A digital signal processor (DSP), comprising: 

2 a pipeline having stages and capable of processing multiply- 

3 accumulate instructions; 

4 an instruction issue unit containing grouping logic and at 

5 least four-wide instruction issue logic; 

6 a multiply-accumulate unit (MAC), coupled to said instruction 

7 issue logic, having an initial multiply stage and a subsequent 

8 accumulate stage; and 

9 out-of-order completion logic, associated with said pipeline, 
10 that causes interim results produced by said multiply stage to be 
11q stored when said accumulate stage is unavailable and allows younger 
12p instructions to complete before said multiply-accumulate 
13=p instructions. 

m 

16. The DSP as recited in Claim 15 wherein said initial 
2|-^ife multiply stage and said subsequent accumulate stage are single 
33 clock cycle stages. 

XSts&t 

17. The DSP as recited in Claim 15 wherein said out-of-order 

2 completion logic is contained in a writeback stage of said 

3 pipeline. 

18. The DSP as recited in Claim 15 wherein said out-of-order 
2 completion logic writes back said interim results to at least one 
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register in said MAC before said multiply-accumulate instructions 
arrive at said accumulation stage of said MAC, 

19. The DSP as recited in Claim 15 wherein said interim 
results are unavailable to an external program executing in said 
DSP. 

20. The DSP as recited in Claim 15 wherein said grouping 
logic groups said multiply-accumulate instructions based on said 
mechanism. 



